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This is a brief review on Brownian functionals in one dimension and their various applications, 
a contribution to the special issue "The Legacy of Albert Einstein" of Current Science. After a 
brief description of Einstein's original derivation of the diffusion equation, this article provides a 
pedagogical introduction to the path integral methods leading to the derivation of the celebrated 
Feynman-Kac formula. The usefulness of this technique in calculating the statistical properties of 
Brownian functionals is illustrated with several examples in physics and probability theory, with 
particular emphasis on applications in computer science. The statistical properties of "first-passage 
Brownian functionals" and their applications are also discussed. 

I. INTRODUCTION 

The year 2005 marks the centenary of the publication of three remarkable papers by Einstein, one on Brownian 
motion [lj, one on special relativity and the other one on the photoelectric effect and light quanta 0]. Each 
of them made a revolution on its own. In particular, his paper on Brownian motion (along with the related work 
by Smoluchowsky £| and Langevin had a more sustained and broader impact, not just in traditional 'natural' 
sciences such as physics, astronomy, chemistry, biology and mathematics but even in 'man-made' subjects such as 
economics and computer science. The range of applications of Einstein's Brownian motion and his theory of diffusion 
is truely remarkable. The ever emerging new applications in diverse fields have made the Brownian motion a true 
legacy and a great gift of Einstein to science. 

There have been numerous articles in the past detailing the history of Brownian motion prior to and after Einstein. 
Reviewing this gigantic amount of work is beyond the scope of this article. This year two excellent reviews on the 
Brownian motion with its history and applications have been published, one by Frey and Kroy and the other 
by Duplantier 0. The former discusses the applications of Brownian motion in soft matter and biological physics 
and the latter, after a very nice historical review, discusses the applications of Brownian motion in a variety of two 
dimensional growth problems and their connections to the conformal field theory. Apart from these two reviews, there 
have been numerous other recent reviews on the 100 years of Brownian motion |8|]-it is simply not possible to cite 
all of them within the limited scope of this article and I apologise for that. The purpose of the present article is to 
discuss some complementary aspects of Brownian motion that are not covered by the recent reviews mentioned above. 

After a brief introduction to Einstein's original derivation of the Stokes-Einstein relation and the diffusion equation 
in Section II, the principal focus of the rest of the article will be on the statistical properties of functionals of 
one dimensional Brownian motion, with special emphasis on their applications in physics and computer science. 
If x(t) represents a Brownian motion, a Brownian functional over a fixed time interval [0, t] is simply defined as 
T = J*U (x(t)) dr, where U{x) is some prescribed arbitrary function. For each realization of the Brownian path, the 
quantity T has a different value and one is interested in the probability density function (pdf ) of T . It was Kac who 
first realized Q that the statistical properties of one dimensional Brownian functionals can be studied by cleverly 
using the path integral method devised by Feynman in his unpublished Ph.D thesis at Princeton. This observation 
of Kac thus took Einstein's classical diffusion process into yet another completely different domain of physics namely 
the quantum mechanics and led to the discovery of the celebrated Feynman-Kac formula. Since then Brownian 
functionals have found numerous applications in diverse fields ranging from probability theory [jj and finance [ll| 
to disordered systems and mesocopic physics jl'J . In this article I will discuss some of them, along with some recent 
applications of Brownian functionals in computer science. 

After a brief and pedagogical derivation of the path integral methods leading to the Feynman-Kac formula in Sections 
III, I will discuss several applications from physics, computer science and graph theory in Section IV. In Section V, the 
statistical properties of "first-passage Brownian functionals" will be discusssed. A first-passage functional is defined 
as T = L f U {x(t)) dr where tf is the first-passage time of the Brownian process x(t), i.e. the first time the process 
crosses zero. Such first-passage functionals have many applications, e.g. in the distribution of lifetimes of comets, in 
queueing theory and also in transport properties in disordered systems. Some of these applications will be discussed 
in Section V. 

The diverse and ever emerging new applications of Brownian functionals briefly presented here will hopefully 
convince the reader that 'Brownian functionalogy' merits the status of a subfield of statistical physics (and stochastic 
calculus) itself and is certainly a part of the legacy that Einstein left behind. 
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II. EINSTEIN'S THEORY OF BROWNIAN MOTION AND LANGEVIN'S STOCHASTIC EQUATION 

Einstein's 1905 paper on Brownian motion |lj achieved two important milestones: (i) to relate macroscopic kinetic 
parameters such as the diffusion constant and friction coefficient to the correlation functions characterizing fluctuations 
of microscopic variables-known as a fluctuation-dissipation relation and (ii) to provide a derivation of the celebrated 
diffusion equation starting from the microscopic irregular motion of a particle-thus laying the foundation of the 
important field of "stochastic processes" . 



A. A fluctuation-dissipation relation 



Very briefly, Einstein's argument leading to the derivation of fluctuation-dissipation relation goes as follows. Imagine 
a dilute gas of noninteracting Brownian particles in a solvent under a constant volume force K (such as gravity) on 
each particle. For simplicity, we consider a one dimensional system here, though the arguments can be generalized 
straightforwardly to higher dimensions. There are two steps to the argument. The first step is to assume that the 
dilute gas of Brownian particles suspended in a solvent behaves as an ideal gas and hence exerts an osmotic pressure 
on the container giving rise to a pressure field. The pressure p(x) at point x is related to the density p(x) via the 
equation of state for an ideal gas: p(x) = ksTp(x), where ks is the Boltzmann's constant and T is the temperature. 
The force per unit volume due to the pressure field —d x p(x) must be balanced at equilibrium by the net external force 
density Kp(x), leading to the force balance condition: Kp(x) = —d x p(x) = —ks Td x p(x). The solution is simply 

p ( X )=p(0) eX p(- 1 ^- T ;xY (1) 

The next step of the argument consists of identifying two currents in the system. The first is the diffusion current 
j diff = —Dd x p{x) where D is defined as the diffusion coefficient. The second is the drift current due to the external 
force, jdrift which can be computed as follows. Under a constant external force, each particle achieves at long times 
a terminal drift velocity, v = K/T where T is the friction coefficient. For spherical particles of radius a, T is given by 
the Stoke's formula, T = Girrja where r\ is the viscosity. Thus, jdrift = vp(x) = Kp(x)/T. Now, at equilibrium, the net 
current in a closed system must be zero, j = jdiff + jdrift = leading to the equation —Dd x p(x) + Kp(x)/T = 0. The 
solution is 

p(x)=p(0)exp(-^x\. (2) 
Comparing Eqs. and J5J Einstein obtained the important relation 

D = ^, (3) 

which is known today as the Stokes-Einstein relation that connects macroscopic kinetic coefficients such as D and T 
to the thermal fluctuations characterized by the temperature T. 



B. Diffusion as a microscopic process 



In addition to the fluctuation-dissipation relation in Eq. @, Einstein's 1905 paper on Brownian motion also provided 
an elegant derivation of the diffusion equation that expressed the diffusion constant D in terms of microscopic fluctua- 
tions. Since the particles are independent, the density p(x, t) can also be interpreted as the probability p(x, t) = P(x, t) 
that a single Brownian particle is at position x at time t and the aim is to derive an evolution equation for P{x, t) 
by following the trajectory of a single particle. Here one assumes that the particle is free, i.e. not subjected to any 
external drift. Einstein considered the particle at position x at time t and asssumed that in a microscopic time step 
At, the particle jumps by a random amount Ax which is thus a stochastic variable. He then wrote down an evolution 
equation for P(x, t) 

/oo 
P(x- Ax,t)(j) At {Ax)d{Ax) (4) 
-oo 

where 4>& t (Ax) is the normalized probability density of the 'jump' Ax in time step At. This evolution equation 
is known today as the Chapman-Kolmogorov equation and it inherently assumes that the stochastic process x(t) is 
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Markovian. This means that the jump variables Ax's are independent from step to step, so that the position x(t) 
of the particle at a given time step depends only on its previous time step and not on the full previous history of 
evolution. Next Einstein assumed that P(x — Ax, t) in the integrand in Eq. J3J can be Taylor expanded assuming 
'small' Ax. This gives 

an .. p.1 p 

P(x, t + At) = P(x, t) - Ml — + | — + . . . (5) 

where — f°° (Ax) fe </>At(Ax)<i(Ax) is the fc-th moment of the jump variable Ax. Furthermore, the absence of 
external drift sets fix = 0. Dividing both sides of Eq. JSJ by At, taking the limit At — > and keeping only the leading 
nonzero term (assuming the higher order terms vanish as At — > 0) one gets the diffusion equation 

dP „ d 2 P 

m= d m (6) 

where the diffusion constant 

D= Urn -^- = lim -j- [°° (Axf <p At (Ax)d(Ax) = lim \ (7) 

At^o 2 At At-^o 2 At J_ oa y ' ' v ; At^o 2 At w 

where ((Ax) 2 ) is the average of the square of the microscopic displacement in a microscopic time step At. Thus Einstein 
was able to express the constant D that appears as a coefficient in the macrosopic diffusion current jdm = —Dd x P in 
terms of the microscopic fluctuation Ax in the position of a Brownian particle. This derivation also brings out the 
fundamental principle of the diffusion process, i.e. the length scale must scale as the square root of the time scale. 

The position of the Brownian particle can evolve via many possible 'stochastic' trajectories. The diffusion equation 
© describing the evolution of the probability density sums up the effects of all underlying stochastic trajectories. 
However, it is often useful to have a mathematical description of each single trajectory. This brings us to the 
description of the diffusion process a la Langevin|5|- It is clear from Einstein's derivation that the local slope of an 
evolving trajectory at time t can be written as 

^ (8) 

where £,At(t) is a random 'noise' which is independent from one microscopic step to another, and it has zero mean. 
Its variance at a given time t, in the continuum limit At — > 0, can also be computed from Eq. J7J. One gets 
(£it(*)) = (( Ax ) 2 )/( At ) 2 = ZD/ At as At 0. Thus the noise term typically scales as l/y/At as At 0. The 
correlation function of the noise between two different times can then be written as, 

(Ut(t)Ut(t')) =0 if t + t' 

= ^ if t = t' (9) 
At y ' 

In the continuum limit At — > 0, the noise t;At(t) then tends to a limiting noise £(i) which has zero mean and a 
correlator, (£(£)£(£')) — 2DS(t — t'). This last result follows by formally taking the limit At — > in Eq. 10 where, 
loosely speaking, one replaces the 1/ At by ^(0). Such a noise is called a 'white' noise. Thus, in the continuum limit 
At — > 0, Eq. JSJl reduces to the celebrated Langevin equation, 

*-«(«) a") 

where £(i) is a white noise. Moreover, in the continuum limit At — > 0, one can assume, without any loss of generality, 
that the white noise £(t) is Gaussian. This means that the joint probability distribution of a particular history of the 
noise variables [{£(T)},for < r < t] can be written as 



Prob [{£(t)}] cx exp 



— l\ 2 {r)dr 
4 Wo 



(11) 



We will see later that this particular fact plays the key role in the representation of Brownian motion as a path 
integral. The Brownian motion x(t) can thus be represented as the integrated white noise, x(t) — x(0) + ^(r)dr. 
While the physicists call this a Brownian motion, the mathematicians call this integrated white noise the Wiener 
process, named after the mathematician N. Wiener. 
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Langevin's formulation in Eq. (|10fl also makes a correspondence between Brownian motion and the random walk 
problem where the position x n of a random walker after n steps evolves via 

£„ (12) 

where £ ra 's are independent random variables, each drawn from the common distribution </>(£) for each step n. In fact, 
the idea of understanding Brownian motion in terms of random walks was first conceived by Smoluchowsky |J] . The 
Langevin equation representation of Brownian motion makes this connection evident, the Brownian motion is just the 
suitably taken continuum limit of the random walk problem. For large n, by virtue of the central limit theorem, the 
results for the random walk problem reduce to those of the Brownian motion. This is an important point because in 
many applications, especially those in computer science as will be discussed later, one often encounters discrete random 
walks as in Eq. 1|12|) which arc often more difficult to solve than the continuum Brownian motion. However, since in 
most applications one is typically interested in the large time scaling-limit results, one can correctly approximate a 
discrete random walk sequence by the continuum Brownian process and this makes life much simpler. 



III. BROWNIAN PROCESS AS A PATH INTEGRAL 



The solution of the diffusion equation 10 can be easily obtained in free space by the Fourier transform method. 
For simplicity, we set D = 1/2 for the rest of the article. One gets 



P(x,t) 



dx G (x, t\x , 0) P(x Q , 0) 



where P(xq,0) is the initial condition and the diffusion propagator 

1 



Go(x,t\xo,0) 



'2wt 



exp [— (x — xo) 2 /2t] 



(13) 



(14) 



denotes the conditional probability that the Brownian particle reaches x at time t, starting from xq at t = 0. It was M. 
Kac who first made the important observation 9] that this diffusion propagator can be interpreted, using Feynman's 
path integral formalism, as the quantum propagator of a free particle from time to time t. This is easy to see. Using 
the property of the Gaussian noise in Eq. Ijlll) and the Langevin equation 1|1U[I . it is clear that the probability of any 
path {x(t)} can be written as 



P [{x(t)}} oc exp 



' /dx Y A 

Tr) dT 



(15) 



Thus the diffusion propagator, i.e. the probability that a path goes from Xq at t = to x at t can be written as a 
sum of over the contributions from all possible paths propagating from Xq at r = to x at r = t. This sum is indeed 
Feynman's path integral |l3j ] 



Go(x,t\xo,0) 



x(Q)=x 



T>x(t) exp 



(It 



(16) 



One immediately identifies the term i (^) as the classical kinetic energy of a particle of unit mass and the integral 



1 Jo (<fr) ^ r as Lagrangian of a free particle of unit mass. Following Fevnmanp^|. one then identifies the path 



integral in Eq. (|16|) as a quantum propagator 

G (x,t\x ,0) =< x\e' Hot \x > 



(17) 



where Hq = —\-§^z is the quantum Hamiltonian of a free particle (we have set the mass m = 1 and the Planck's 
constant h = 1). To make the connection complete, the quantum propagator on the r.h.s of Eq. I|17[l can be easily 
evaluated by expanding it in the free particle eigenbasis. Noting that Hq has free particle eigenfunctions ipk(%) = 
_i e ikx w j^ 1 eigenvalue fc 2 /2, one gets 



Gq(x, t\xo, 0) =< x\e 



-H t 



X > = 



< x\k >< k\x > e k2t / 2 dk = -— 

2n 



e ik(x-x a )-k 2 t/2 



(18) 
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Performing the Gaussian integration, one gets back the classical result in Eq. I|14|) that was obtained by solving the 
diffusion equation. Thus the two approaches, one by solving a partial differential equation usually referred to as the 
Fokker-Planck approach and the other using the path integral method are completely equivalent. 

One may argue that once the basic propagator is known, the Brownian motion is well understood and there is 
nothing else interesting left to study! This is simply not true because there are intricate questions associated with the 
diffusion process that are often rather nontrivial. A notable nontrivial example is the calculation of the persistence 
exponent associated with a diffusion process |14|. Consider a diffusive field 4>{r,t) evolving via the d-dimensional 
diffusion equation 



at 



(19) 



starting from the initial condition (f>(r, 0) which is a random Gaussian field, uncorrelated in space. The solution at 
time t can be easily found using d-dimensional trivial generalization of the diffusion propagator in Eq. (|14ll 



<t>(f,t) 



(27rf) d / 2 



df <Mr o ,0) exp [-(f- f ) 2 /2t] 



(20) 



Now, suppose that we fix a point r in space and monitor the field <fi(r, i) there as a function of time t and ask: what 
is the probability P(t) that the field (f>(r, t) at r does not change sign up to time t starting initially at the random 
value 4>(r, 0)? By translational invariance, P(t) does not depend on the position r. This probability P(t) is called 
the persistence probability that has generated a lot of interest over the last decade in the context of nonequilibrium 
systems ^ij. For the simple diffusion process in Eq. 1)2(1 \ , it is known, both theoretically [l^ and experimentally 0] 
that at late times t, the persistence P(t) has a power law tail P(t) ~ t~ where the persistence exponent 8 is 
nontrivial (even in one dimension!), e.g., 8 » 0.1207 in d = 1, 9 s» 0.1875 in d = 2, 6 sw 0.2380 in d = 3 etc. While this 
exponent 8 is known numerically very precisely and also very accurately by approximate analytical methods [15J, an 
exact calculation of 8 has not yet been achieved and it remains as an outstanding unsolved problem for the diffusion 
process ^3] • This example thus clarifies that while the knowledge of the diffusion propagator is necessary, it is by no 
means sufficient to answer more detailed history related questions associated with the diffusion process. 

Note that in the persistence problem discussed above, the relevant stochastic process at a fixed point r in space, 
whose properties one is interested in, is actually a more complex non-Markovian process |14| even though it originated 
from a simple diffusion equation. In this article, we will stay with our simple Brownian motion in Eq. (|10fl which is 
a Markov process and discuss some of the nontrivial aspects of this simple Brownian motion. For example, in many 
applications of Brownian motion in physics, finance and computer science, the relevant Brownian process is often 
constrained. For example, an important issue is the first-passage property of a Brownian motion |l8l ll9L l20j|. i.e. the 
distribution of the first time that a Brownian process crosses the origin? For this, one needs to sample only a subset 
of all possible Brownian paths that do not cross the origin up to a certain time. This can be achieved by imposing the 
constraint of no crossing on a Brownian path. Apart from the constrained Brownian motion, some other applications 
require a knowledge of the statistical properties of a Brownian functional up to time t, defined as T t — L U (x(r)) dr, 
where U(x) is a specified function. We will provide several examples later and will see that while the properties of 
a free Brownian motion are rather simple and are essentially encoded in its propagator in Eq. I|14|) . properties of 
constrained Brownian motion or that of a Brownian functional are often nontrivial to derive and the path integral 
technique discussed above is particularly suitable to address some of these issues. 

A. Brownian motion with constraints: first-passage property 

As a simple example of a constrained Brownian motion, we calculate in this subsection the first-passage probability 
density f(x ,t). The quantity f(xo,t)dt is simply the probability that a Brownian path, starting at xq at t — 0, will 
cross the origin for the first time between time t and t + dt. Clearly, f(xo,t) = —dq(xo,t)/dt where q(xo,t) is the 
probability that the path starting at xq at t — does not cross the origin up to t. The probability q(xo,t) can be 
easily expressed in terms of a path integral 



/•oo px(t)—x 

q(xo,t) = I dx T>x(t) exp 

JO Jx(0)=x o 



Y[8[x(r)] 



(21) 



where the paths propagate from the initial position x(0) = xq to the final position x at time t and then we integrate 
x over only the positive half-space since the final position x can only be positive. The term Jlt=o ^ [ x ( r )] ms ide the 
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path integral is an indicator function that enforces the constraint that the path stays above the origin up to t. We 
then identify the path integral in Eq. (|21[1 as an integral over a quantum propagator, 



q(x ,t)= dxG(x,t\x o ,0); G(x, t\x , 0) =< x\e- Hlt \x > (22) 
Jo 

where the Hamiltonian Hi = —\-§^i + V(x) with the quantum potential V(x) = if x > and V{x) = oo if x < 0. 
The infinite potential for x < takes care of the constraint that the path can not cross the origin, i.e. it enforces the 
condition Jlt=o ^ IM 7 ")]- The eigenfunction of Hi must vanish at x — 0, but for x > it corresponds to that of a free 



particle. The correctly normalized eigenfunctions are thus ipk(x) = y f sin(fcx) with k > with eigenvalues k 2 /2. 
The quantum propagator can then be evaluated again by decomposing into the eigenbasis 

n />oo -i 

G(x,t\x o ,0) = - / sm(kx )sm(kx)e- k2t/2 dk = ^= e -(*-*o)'/2t _ e -(x+x f/2t ^ 

Note that this result for the propagator can also be derived alternately by solving the diffusion equation with an 
absorbing boundary condition at the origin. The result in Eq. I|23|) then follows by a simple application of the 
image method |2£j. Integrating over the final position in x one gets from Eq. ill' 21 the classical result [T^ |. 
q(xo,t) — erf(x/v / 2t) where erf(z) = J* e~ u du. The first-passage probability density is then given by 

dg{x 0l t) x e~ x °/ 2t 

fM = — — = (24) 

For t ^> Xq, one thus recovers the well known £~ 3 / 2 decay of the first-passage probability density. 



B. Brownian functionals: Feynman-Kac formula 

In this subsection we will discuss how to calculate the statistical properties of a Brownian functional defined as 

T= f U{x(r))dT (25) 
Jo 

where x(t) is a Brownian path starting from xq at r = and propagating up to time r = t and U(x) is a specified 
function. Clearly T is random variable taking different values for different Brownian paths. The goal is to calculate 
its probability distribution P(T, t\xo). The choice of U(x) depends on which quantity we want to calculate. Brownian 
functionals appear in a wide range of problems across different fields ranging from probability theory, finance, data 
analysis, disordered systems, and computer science. We consider a few examples below. 

1. In probability theory, an important object of interest is the occupation time, i.e. the time spent by a Brownian 
motion above the origin within a time window of size t 21]. Thus the occupation time is simply, T — J* 6\x(r)]dT. 
Thus, in this problem the function U(x) — 6(x). 

2. For fluctuating (1 + l)-dimensional interfaces of the Edwards- Wilkinson [2^ or the Kardar-Parisi-Zhang 
(KPZ) [23j varieties, the interface profile in the stationary state is described by a one dimensional Brown- 
ian motion in space [24[. The fluctuations in the stationary state are captured by the pdf of the spatially 

averaged variance of height fluctuations j2f| in a finite system of size L, i.e. the pdf of a 2 = £ h 2 (x)dx where 
h(x) is the deviation of the height from its spatial average. Since h(x) performs a Brownian motion in space, 
a 2 is a functional of the Brownian motion as in Eq. I|25|l with U(x) = x 2 . 

3. In finance, a typical stock price S(t) is sometimes modelled by the exponential of a Brownian motion, S(t) = 
e -{3x{r) ^ w ] lere p j s a constant. An object that often plays a crucial role is the integrated stock price up to some 
'target' time t, i.e. T = f Q e^^^^dr [2||. Thus in this problem U(x) — e~ l3x . Interestingly, this integrated 
stock price has an interesting analogy in a disordered system where a single overdamped particle moves in a 
random potential. A popular model is the so called Sinai model |27| where the random potential is modelled as 
the trace of a random walker in space. Interpreting the time r as the spatial distance, x(t) is then the potential 
energy of the particle and e~^ xtyT ^ is just the Boltzmann factor. The total time t is just the size of a linear box 
in which the particle is moving. Thus T = J* e~ l3x ^dT is just the partition function of the particle in a random 
potential psj j . In addition, the exponential of a Brownian motion also appears in the expression for the Wigner 
time delay in one dimensional quantum scattering process by a random potential [29j | . 
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4. In simple models describing the stochastic behavior of daily temperature records, one assumes that the daily 
temperature deviation from its average is a simple Brownian motion x(r) in a harmonic potential (the Ornstcin- 
Uhlenbeck process). Then the relevant quantity whose statistical properties are of interest is the so called 
'heating degree days' (HDD) defined as T — J Q x(t) 9(x(r)) dr that measures the integrated excess temperature 
up to time t |3dj |. Thus in this example, the function U(x) — x9(x). 

5. Another quantity, first studied in the context of economics |3l| and later extensively by probabilists 32j is the 
total area (unsigned) under a Brownian motion, i.e. T — f Q \x(t)\ dr, Thus in this example, U(x) = \x\. The 
same functional was also studied by physicists in the context of electron-electron and phase coherence in one 
dimensional weakly disordered quantum wire|33|. 

We will mention several other examples as we go along. Note that in all the examples mentioned above the function 
U{x) is such that the random variable T has only positive support. Henceforth we will assume that. For a given such 
function U(x), how does one calculate the pdf of T? It was Kac who, using the path integral techniques developed 
by Feynman in his Ph.D thesis, first devised a way of computing the pdf P(T, t\xo) of a Brownian functional that 
led to the famous Feynman-Kac formula. We summarize below Kac's formalism. 



Feynman-Kac formula: Since T has only positive support, a natural step is to introduce the Laplace transform of 
the pdf P(T, t\xo), 



Q(xa,t) 



- pT P(T,t\x )dT = E X 



-p /* U{x{r))dT 



(26) 



where the r.h.s is an expectation over all possible Brownian paths {.t(t)} that start at xq at r = and propagate up 
to time t = t. We have, for notational simplicity, suppressed the p dependence of Q(xo,t). Using the measure of the 
Brownian path in Eq. I|15|l , one can then express the expectation on the r.h.s of Eq. H2b[) as a path integral 



Q(x ,t) = E XQ 



,-p .ft U(x(r))dn 



px(t)—x 

dx / Vx{t) exp 

) J x(0)— Xq 

dx < x\e~ m \xo > 







fdx\ 


- f dr 


} 




Jo 




Kdr) 



(27) 
(28) 



where the quantum Hamiltonian H = —\-§^i +pU (x) corresponds to the Shrodinger operator with a potential pU(x). 
Note that in Eq. I|27l) all paths propagate from x(0) = xo to x(t) — x in time t and then we have integrated over the 
final position x. The quantum propagator G(x,t\xo) =< x\e~ Ht \xo > satisfies a Shrodinger like equation 



dG _ l d 2 G 
~dt ~ Ydx 1 



pU{x)G 



(29) 



which can be easily established by differentiating G(x,t\xo) =< x\e \xq > with respect to t and using the explicit 
representation of the operator H. The initial condition is simply, G(x, 0\xq) = 8(x — xq)- Thus the scheme of Kac 
involves three steps: (i) solve the partial diferential equation (|2*9")l to get G(x,t\xo) (ii) integrate G(x, t\xo) over the 
final position x as in Eq. Ij28(l to obtain the Laplace transform Q(xo,t) and (iii) invert the Laplace transform in 
Eq. (|26|l to obtain the pdf P(T,t\x ). The equations (|26|) . I|28|l and (|29|) are collectively known as the celebrated 
Feynman-Kac formula. 



A shorter backward Fokker-Planck approach: An alternative and somewhat shorter approach would be to write 
down a partial differential equation for Q(x , t) in Eq. I|28|l directly. An elementary exercise yields 

dQ ld 2 Q 

-m = 2^xi- pU{x ° )Q (30) 

where note that the spatial derivatives are with respect to the initial position x$. This is thus a 'backward' Fokker- 
Planck approach as opposed to the 'forward' Fokker-Planck equation satisfied by G in Eq. I|29|l of Kac where the 
spatial derivatives are with respect to the final position of the particle. Basically we have reduced the additional step 
(ii) of integrating over the final position in Kac's derivation. The solution Q(xq, t) of Eq. (|30|l must satisfy the initial 
condition Q(xq,0) — 1 that follows directly from the definition in Eq. (|26|l . To solve Eq. (|30[l . it is useful to take a 
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further Laplace transform of Eq. I|30|) with respect to t, Q(xo,a) = J °° Q(xo, t)e at dt. Using the initial condition 
Q(xq,Q) — 1, one arrives at an ordinary second order differential equation 

\^-[a+pU{x Q )]Q = -l (31) 

which needs to be solved subject to the appropriate boundary conditions that depend on the behavior of the function 
U(x) at large x. Given that T = J Q U (x(r)) dr has positive support, there are two typical representative asymptotic 
behaviors of U (x) : 

1. If the function U(x) approaches a constant value at large x, i.e. U(x) —> c± &s x —> ±oo, then it is easy to 
argue (for an example, see below) that Q(xq — * ±oo,a) = l/[pc± + a]. In this case, the underlying quantum 
Hamiltonian H = — ^ + pU (x) has scattering states in its spectrum, in addition to possible bound states. 

2. If the function U{x) — > oo as x — > ±oo, then Q(xq — > ±oo,a) = 0. In this case the underlying quantum 
Hamiltonian H has only bound states and hence a discrete spectrum. 

Thus, in principle, knowing the solution Q(xq, a) of Eq. I|31l) . the original pdf P(T, t\xo) can be obtained by inverting 
the double Laplace transform 

/•CO P oo 

Q(x ,a)= dte- at dT e - pT P(T,t\x ). (32) 
Jo Jo 

Below we provide an example where all these steps can be carried out explicitly to obtain an exact closed form 
expression for the pdf P(T,t\xo). 



A simple illustration: Levy's arcsine law for the distribution of the occupation time 



As an illustration of the method outlined in the previous subsection, let us calculate the distribution of the occupa- 
tion time T = J Q 9[x(r)]dT. This distribution was first computed by Levy using probabilistic methods |2j|. Later Kac 
derived it using Feynman-Kac formalism discussed above We present here a derivation based on the backward 
Fokker-Planck approach outlined above. 

Substituting U(xq) = 0(xq) in Eq. I|31|) we solve the differential equation separately for xq > and xq < and 
then match the solution at xq — by demanding the continuity of the solution and that of its first derivative. In 
addition, we use the boundary conditions Q(xq — > oo, a) = l/(a + p) and Q(xq — * — oo,a) = 1/a. They follow from 
the observations: 

1. If the starting point xq — > oo, the particle will stay on the positive side for all finite t implying T = J Q 9(x(t)) dr — 
t and hence Q(xq — * oo,i) — E[e~ pT ] — e~ pt and its Laplace transform Q(xq — * oo,a) = J °° e~( a+p ' t dt = 
l/(a + p). 



2. If the starting point xq 



-oo, the particle stays on the negative side up to any finite t implying T 

-oo, a) 



J Q 9(x(r))dr — and hence Q(xq — > — oo, t) — E[e pT ] = 1 and its Laplace transform Q(xq 
f™e- at dt=l/a. 

Using these boundary and matching conditions, one obtains an explicit solution 



Q(x ,a) 



I 



(a+p) 
1 



fa 



1 



(s/a- v 7 " + p) 



/2 a Xq 



for xo > 
for xq < 0. 



\JOL+P 

The solution is simpler if the particle starts at the origin xq = 0. Then one gets from above 

Q(0, a) 1 



^fa(a+p) 



(33) 
(34) 

(35) 



Inverting the Laplace transform, first with respect to p and then with respect to a, one obtains the pdf of the 
occupation time for all < T < t 



1 



P(T,t\x = 0) = - 



1 



7T y/T(t - T) 



(36) 
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In particular, the cumulative distribution 

9 / Fr\ 

(37) 



P(T', t\x = 0)dT' = ^ arcsin L 



is known as the famous arcsine law of Levy [21j • 

The result in Eq. I|36|) is interesting and somewhat counterintuitive. The probability density peaks at the two end 
points T = and T = t and has a minimum at T = 1/2 which is also the average occupation time. Normally one 
would expect that any 'typical' path would spend roughly half the time t/2 on the positive side and the other half 
on the negative side. If that was the case, one would have a peak of the occupation time distribution at the average 
value t/2. The actual result is exactly the opposite-one has a minimum at T = t/2\ This means that a typical path, 
starting at the origin, tends to stay either entirely on the positive side (explaining the peak at T = t) or entirely on 
the negative side (explaining the peak at T = 0). In other words, a typical Brownian path is 'stiff' and reluctant 
to cross the origin. This property that 'the typical is not the same as the average' is one of the hidden surprises of 
Einstein's Brownian motion. 

The concept of the occupation time and related quantities have been studied by probabilists for a long time |34| . 
Recently they have played important roles in physics as well, for example in understanding the dynamics out of 
equilibrium in coarsening systems |35| . ergodicity properties in anomalously diffusive processses |3Gl . in renewal 
processes m models related to spin glasses |38| . in understanding certain aspects of transport properties in 
disordered sysyems |39j and also in simple models of blinking quantum dots 40] . 



IV. AREA UNDER A BROWNIAN EXCURSION: APPLICATIONS IN PHYSICS AND COMPUTER 

SCIENCE 

In this section we consider an example where, by applying the path integral method outlined in the previous section, 
one can compute exactly the distribution of a functional of a Brownian process that is also constrained to stay positive 
over a fixed time interval [0, t], A Brownian motion x(t) in an interval < r < t, that starts and ends at the origin 
x(0) = x(t) = but is conditioned to stay positive in between, is called a Brownian excursion. The area under the 
excursion, A = J Q x(r)dr, is clearly a random variable taking a different value for each realization of the excursion. A 



natural question that the mathematicians have studied quite extensively |4jj, |42j, |43j, |4J, |45j over the past two decades 
is: what is the pdf P(A,t) of the area under a Brownian excursion over the interval [0,t]? Since the typical lateral 
displacement of the excursion at time r scales as y/r, it follows that the area over the interval [0, t] will scale as i 3 / 2 and 
hence its pdf must have a scaling form, P(A,t) = t~ 3 / 2 f {A/t 3 ^ 2 ). The normalization condition P(A,t)dA = 1 
demands a prefactor i~ 3 / 2 and also the conditions: f(x) > for all x and J °° f(x)dx — 1. One then interprets the 
scaling function f{x) as the distribution of the area under the Brownian excursion x(u) over a unit interval u € [0, 1]. 
The function fjx), or rather its Laplace transform, was first computed analytically by Darling |4l) and independently 
by Louchard |42]|. 

f(s) = / f(x)e-"dx = sV2^J2 e^ 3 ^ 3 , (38) 
Jo k=i 

where a^'s are the magnitudes of the zeros of the standard Airy function Ai(z). The Airy function Ai(z) has discrete 
zeros on the negative real axis at e.g. z = —2.3381, z — —4.0879, z — —5.5205 etc. Thus, ol\ = 2.3381..., 
a2 = 4.0879 . . . , a3 = 5.5205 . . . etc. Since the expression of f(x) involves the zeros of Airy function, the function 
fix) has been named the Airy distribution function |44|, which should not be confused with the Airy function Ai(x) 
itself. Even though Eq. I|38|) provides a formally exact expression of the Laplace transform, it turns out that the 
calculation of the moments M n = J °° x n f(x)dx is highly nontrivial and they can be determined only recursively 01 
(see Section II). Takacs was able to formally invert the Laplace transform in Eq. I|38() to obtain |43j . 

9 /fi 00 

= H^E e ~ Wa;2& r ^(-5/6, 4/3 A-/* 2 ), (39) 
x k=i 

where bk = 2a|/27 and U(a,b,z) is the confluent hypergeometric function ^tJ- The function f(x) has the following 
asymptotic tails [4J, |43 , 

f(x) ~ x~ 5 e~ 2a3 ' 21x2 as x^0 

f(x) ~ e~ 6x2 as x^oo. (40) 
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FIG. 1: A Brownian excursion over the time interval < r < t starting at x(0) 
in between. 



e and ending at x(t) — e and staying positive 



So, why would anyone care about such a complicated function? The reason behind the sustained interest and study [43, 
13 EE 01 this function f{x) seems to be the fact that it keeps resurfacing in a number of seemingly unrelated 
problems, in computer science, graph theory, two dimensional growth problems and more recently in fluctuating one 
dimensional interfaces. We discuss below some of these applications. 

The result in Eq. i|38|) was originally derived using probabilistic methods 0, ^2 • A more direct physical derivation 
using the path integral method was provided more recently [49^ . which we outline below. Following the discussion in 
the previous section, our interest here is in the functional T = A = J x(t)cIt. However, we also need to impose the 
constraint that the path stays positive between and t, i.e. we have to insert a factor J7 r 6*[:e(t)] in the path integral. 
However, one needs to be a bit careful in implementing this constraint. Note that the path starts at the origin, i.e. 
x(0) = 0. But if we take a continuous time Brownian path that starts at the origin, it immediately recrosses the 
origin many times and hence it is impossible to restrict a Brownian path to be positive over an interval if it starts at 
the origin. One can circumvent this problem by introducing a small cut-off e, i.e. we consider all paths that start at 
a;(0) = e and end at x(t) = e and stays positive in between (see Fig. (JIJ). We then first derive the pdf P(A, t, e) and 
then take the limit e — > eventually. 

Following the method in the previous section, the Laplace transform of the pdf is now given by 



Q(e,t) = E e 









Ze J x 



x(t)=e 



(41) 



where Ze is a normalization constant 



x(t)=t 
x(0)=e 



Vx(t) e~ 2 /o dT ( dx / dT f Y[ [x{t)\ 

T=0 



(42) 



that is just the partition function of the Brownian excursion. 

Clearly, Z E =< e\e~ Hlt \e > where Hi = -\-§^i + V(x), with the potential V(x) — for x > and V(x) = oo 
for a; < 0. We have already evaluated this in Section III- A in Eq. I|23l) . Putting x = Xq = e in Eq. (|23|) we get 
Ze = G(e, t\e, 0) = (1 — e~ 2e t )/y/2nt. The path integral in the numerator in Eq. I|41|l is simply the propagator 



11 



< e\e Ht \e > where the Hamiltonian H = — + pU(x) with a triangular potential U(x) = x for x > and 

U(x) — oo for x < 0. The Hamiltonian H has only bound states and discrete eigenvalues. Its eigenfunctions are 
simply shifted Airy functions and eigenvalues are given by the negative of the zeros of the Airy function. Expanding 
the propagator into its eigenbasis and finally taking the e — > limit (for details see Ref. [49] ) , one derives the result 

/■oc 00 

Q(0,t)= P(A,t)e-P A dA = V2^(pt 3 / 2 )Y,^ 2 ~ 1/3akipt3/2)2/a (43) 
Jo fc=i 

where ajt's are the negative of the zeros of the Airy function. The result in Eq. (|43|) indicates that its inverse Laplace 
transform has the scaling form, P(A,t) — t~ 3 l 2 f(At~ 3 ' 2 ) where the Laplace transform of the scaling function f(x) 
is given in Eq. (JJSJ. 

Applications of the Airy Distribution Function: The Airy distribution function in Eq. I|39|) has appeared in a 
number of applications ranging from computer science and graph theory to physics. Below we mention some of these 
applications. 

1. Cost function in data storage: One of the simplest algorithms for data storage in a linear table is called 
the linear probing with hashing (LPH) algorithm. It was originally introduced by D. Knuth |50j and has been the 
object of intense study in computer science due to its simplicity, efficiency and general applicability |44j . Recently 
it was shown 51] that the LPH algorithm gives rise to a correlated drop-push percolation model in one dimension 
that belongs to a different universality class compared to the ordinary site percolation model. Knuth, a pioneer in 
the analysis of algorithms, has indicated that this problem has had a strong influence on his scientific career [44| . 
The LPH algorithm is described as follows: Consider M items xi, X2, ■ ■ ■ , xm to be placed sequentially into a linear 
table with L cells labelled 1, 2, . . . , L where L > M. Initially all cells are empty and each cell can contain at most 
one item. For each item Xi, a hash address hi € {1,2,...,L} is assigned, i.e. the label hi denotes the address of 
the cell to which Xi should go. Usually the hash address hi is chosen randomly from the set {1, 2, . . . , L}. The item 
Xi is inserted at its hash address hi provided the cell labelled hi is empty. If it is already occupied, one tries cells 
hi + 1, hi + 2, etc. until an empty cell is found (the locations of the cells are interpreted modulo L) where the item 
Xi is finally inserted. In the language of statistical physics, this is like a drop-push model. One starts with an empty 
periodic lattice. A site is chosen at random and one attempts to drop a particle there. If the target site is empty, the 
incoming particle occupies it and one starts the process with a new particle. If the target site is occupied, then the 
particle keeps hopping to the right until it finds an empty site which it then occupies and then one starts with a new 
particle and so on. 

From the computer science point of view, the object of interest is the cost function C(M, L) defined as the total 
number of unsuccessful probes encountered in inserting the M items into a table of size L. In particular, the total 
cost C = C(L, L) in filling up the table is an important measure of the efficiency of the algorithm. The cost C is 
clearly a random variable, i.e. it has different value for different histories of filling up the table. A central question is: 
What is its pdf P(C, L)7 It has been shown rigorously by combinatorial methods |44j that P(C, L) has a scaling form 
for large L, P(C, L) ~ L~ 3 / 2 f(CL~ 3 / 2 ) where the scaling function f(x) is precisely the Airy distribution function in 
Eq. (|39|l that describes the distribution of area under a Brownian excursion. To understand the connection between 
the two problems, consider any given history of the process where the table, starting initially with all sites empty, 
gets eventually filled up. We define a stochastic quantity Xi that measures the total number of attempts at site i till 
the end of the process in any given history. Clearly Xj > 1 and out of Xi attempts at site i, only one of the attempts 
(the first one) has been successful in filling up the site, the rest (Xi — 1) of them had been unsuccessful. Thus, the 
total cost is C = ^2 i=1 (Xi — 1). Now, the site (i — 1) has been attempted A^_i times, out of which only the first one 
was successful and the rest (A^_i — 1) attempts resulted in pushing the particle to the right neighbour i and thus 
each of these unsuccessful attempts at (i — 1) result in an attempt at site i. Thus, one can write a recursion relation 

J5Q = J5Q_! - 1 + & (44) 

where £j is a random variable that counts the number of direct attempts (not coming from site (i — 1)) at site i. Thus 
Prob(£ = fe) = Prob(the site i is chosen for direct hit k times out of a total L trials) = ( k )(l/L) k (l — l/L) L_k , since 
for random hashing, the probability that site i is chosen, out of L sites, is simply \jL. Clearly the noise £ has a mean 
value, < £ >= 1. If we now define Xi — Xi — 1, then Xj's satisfy 

Xi = Xi-i + rji (45) 

where r/i — £j — 1 is a noise, independent from site to site, and for each site i, it is chosen from a binomial distribution. 
Note that < r\i >=< > —1 = 0. Thus, x^a clearly represent a random walk in space from to L with periodic 
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FIG. 2: The LPH algorithm for a table of 10 sites. The figure shows an incoming item which chose randomly the site 1 to 
drop, but since site 1 is already occupied, the incoming item keeps hopping to the right until it finds an empty cell at location 
4 to which it gets absorbed. 



boundary conditions. Moreover, since > 1, we have Xi > 0, indicating that it is a discrete version of a Brownian 

excursion and the total cost C = X)i=i(-^i — -0 = Eti x i 1S J us t the area under the Brownian excursion. For large 
number of steps L, the discrete and the continuum version share the same probability distribution, thus proving that 
the probability distribution of the total cost in LPH algorithm is precisely the same as that of the area under a 
Brownian excursion. 

2. Internal path lengths on rooted planar trees: Rooted planar trees are important combinatorial objects in 
graph theory and computer science |52^ . Examples of rooted planar trees with n+ 1 = 4 vertices are shown in Fig. 
There are in general C n +i = ^qrx ( 2 ,™) number of possible rooted planar tree configurations with (n + 1) vertices. For 
example, C\ = 1, Ci = 1, C3 = 2, C4 = 5, Cq = 14 etc. -these are the Catalan numbers. An important quantity 
of interest is the total internal path length d of a tree which is simply the sum of the distances of all the n vertices 
from the root, d — dji d% being the distance of the z-th vertex from the root. Each tree configuration has a 

particular value of d, e.g. in Fig. the 5 different configurations have values d = 6, d — 4, d — 4, d — 5 and d = 3 
respectively. Suppose that all C n +i configurations of trees for a fixed n are sampled with equal probability: what is 
the probability density P{d 1 n) of the internal path length d? This problem again can be mapped 43] to the problem 
of the area under a Brownian excursion as shown in Fig. Starting from the root of a planar tree with (n + 1) 
vertices, suppose one traverses the vertices of a tree as shown by the arrows in Fig. @, ending at the root. We think 
of this route as the path of a random walker in one dimension. For each arrow pointing away from the root on the 
tree, we draw a step of the random walker with an upward slope. Similarly, for each arrow pointing to the root on 
the tree, we draw a step of the random walker with a downward slope. Since on the tree, one comes back to the root, 
it is evident by construction that the corresponding configuration of the random walker x m is an excursion (i.e. it 
never goes to the negative side of the origin) that starts at the origin and ends up at the origin after 2n steps, xq = 
and X2n = 0. Such excursions of a discrete random walk are called Dyck paths. Now, the total internal path length 
d of any tree configuration is simply related to the total 'area' under a Dyck path via, 2d = X)m=i • E ™ + n ' as carl 
be easily verified. Now, for large n, Dyck paths essentially becomes Brownian excursions and the object Y] " —1 x m 
is simply the area A2 n under a Brownian excursion over the time interval [0, 2n]. Since A2 n ~ (2n) 3 / 2 for large n, it 
follows that d ~ A2 n /2. Therefore, its probability density P(d,n) has a scaling form, P(d,n) — / - 1 3/2 f(d/V2n 3 ^ 2 ) 
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FIG. 3: The 5 possible rooted planar tree with (3+ 1) vertices. Each configuration has an associated total internal path length 
d listed below the configuration. Any given tree configuration, say the second one in the figure, has a one to one correspondence 
to a Dyck path, i.e. a configuration of a Brownian excursion (discrete time random walk). 



where f(x) is precisely the Airy distribution function in Eq. 1391) . 

3. Maximal relative height distribution for fluctuating interfaces: Fluctuating interfaces have been widely 
studied over the last two decades as they appear in a variety of physical systems such as growing crystals, molecular 
beam epitaxy, fluctuating steps on metals and growing bacterial colonies p3 |. The most well studied model of a 
fluctuating (1 + l)-dimensional surfaces is the so called Kardar-Parisi-Zhang (KPZ) equation [23j that describes the 
time evolution of the height H (x, i) of an interface growing over a linear substrate of size L via the stochastic partial 
differential equation 



dH(x,t) _ d 2 H{x,t) f dH(x,t) 
dt dx 2 \ dx 



V(x,t), 



(46) 



where rj(x,t) is a Gaussian white noise with zero mean and a correlator, (rj(x,t)r](x't')) — 25(x — x')5(t — t'). If 
the parameter A = 0, the equation becomes linear and is known as the Edwards- Wilkinson equation 22]. We 
consider the general case when A > 0. The height is usually measured relative to the spatially averaged height, i.e. 
h(x, t) — H(x, t) — H(x / 1 t)dx' / L. The joint probability distribution of the relative height field P({h}, t) becomes 
time-independent as t — > oo in a finite system of size L. An important quantity that has created some interests 
recently [53l 153. l5o| is the pdf of the maximal relative height (MRH) in the stationary state, i.e. P(h mi L) where 



h m = lim max x [{h(x,t)},0 < x < L] 



(47) 



This is an important physical quantity that measures the extreme fluctuations of the interface heights. Note that 
in this system the height variables are strongly correlated in the stationary state. While the theory of extremes 
of a set of uncorrelated (or weakly correlated) random variables is well established [SSj, not much is known about 
the distribution of extremes of a set of strongly correlated random variables. Analytical results for such strongly 
correlated variables would thus be welcome from the general theoretical perspective and the system of fluctuating 
interfaces provides exactly the opportunity to study the extreme distribution analytically in a strongly correlated 
system. This problem of finding the MRH distribution was recently mapped |49L l54| again to the problem of the area 
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FIG. 4: A Brownian path x(t), starting at xo at r = 0, crosses the origin for the first time at r = tf, tf being the first-passage 
time. 

under a Brownian excursion using the path integral method outlined in section-Ill and it was shown that for periodic 
boundary conditions, P(h m ,L) = L -1 / 2 f(h m /\/L) where f{x) is again the Airy distribution function in Eq. I|39|) . 
Interestingly, the distribution does not depend explicitly on A. This is thus one of the rare examples where one can 
calculate analytically the distribution of the extreme of a set of strongly correlated random variables ^Jj, • 

4. Other applications: Apart from the three examples mentioned above, the Airy distribution function and its 
moments also appear in a number of other problems. For ex amp le, The generating function for the number of 
inversions in trees involves the Airy distribution function f(x) [53. Also, the moments M ra 's of the function f(x) 
appear in the enumeration of the connected components in a random graph |58|. Recently, it has been conjectured 
and subsequently tested numerically that the asymptotic pdf of the area of two dimensional self-avoiding polygons is 
also given by the Airy distribution function f(x) [5Gf . Besides, numerical evidence suggests that the area enclosed by 
the outer boundary of planar random loops is also distributed according to the Airy distribution function f{x) |59| . 



So far we have studied the pdf of a Brownian functional over a fixed time interval [0, i]. In this section, we show 
how to compute the pdf of a Brownian functional over the time interval [0, i/] where tf is the first-passage time of 
the process, i.e. tf itself is random. More precisely, we consider a functional of the type 



where x(r) is a Brownian path starting from x > at r = and propagating up to time t — t and U (a;), as before, is 
some specified function. The integral in Eq. (|48|l is up to the first-passage time tf which itself is random in the sense 
that it varies from realization to realization of the Brownian path (see Fig. Such functionals appear in many 

problems (some examples are given below) in physics, astronomy, queuing theory etc. and we will generally refer to 
them as first-passage Brownian functionals. 



V. 



FIRST-PASSAGE BROWNIAN FUNCTIONAL 




(48) 
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We would like to compute the pdf P(T\x ) of T in Eq. gHJ given that the Brownian path starts at the initial 
position xq. As before, it is useful to consider the Laplace transform 

poo t 

Q(ar„) = / e - pT P(T\x )dT = { e - p I° (49) 



where the rhs is an average over all possible Brownian paths starting at xo at r = and stopping at the first time 
they cross the origin. For brevity, we have suppressed the p dependence of Q(x )- Note that each path, starting 
from xo, evolves via Eq. (jTU)l where £(t) is a delta correlated white noise. Note also that tf varies from path to path. 
Thus at first sight, this seems to be a rather difficult problem to solve. However, as we will see now that in fact this 
problem is simpler than the previous problem over a fixed time interval [0, t]\ 

To proceed, we split a typical path over the interval [0, tf] into two parts: a left interval [0, At] where the process 
proceeds from xq to xo + Ax = xo + £(0)Ar in a small time At and a right interval [At, tf] in which the process 

starts at xq + Ax at time At and reaches at time tf. The integral f * U (x(t)) dr is also split into two parts: 

J* f — J q At + . Since the initial value is xq, one gets J Q Ar U (x(t)) dr — U(xq)At for small At. Then the Eq. i|49|) 
can be written as 

Q(x Q ) = (e-f/o'^W)*-) = ( e -P u ^ AT Q(x + Ax)) Ax , (50) 

where we have used the fact that for the right interval [At, tf], the starting position is x$ + Ax — xq + £(0) At, which 
itself is random. The average in the second line of Eq. H50|) is over all possible realizations of Ax. We then substitute 
Ax = £(0)At in Eq. Q50JI. expand in powers of At and average over the noise £(0). We use the fact that the noise 
is delta correlated, i.e. (£ 2 (0)) = 1/At as At — * 0. The leading order term on the right hand side of Eq. lt5U|) is 
independent of At and is simply Q(xq) which cancels the same term on the left hand side of Eq. I|50[) . Collecting the 
rest of the terms we get 



ld 2 Q 
2~dxJ 



pU(x )Q(x ) 



At + O ({At) 2 ) = 0. (51) 



Equating the leading order term to zero provides us an ordinary differential equation 



l^~pU(x o )Q(x o ) = (52) 



which is valid in xo G [0, oo] with the following boundary conditions: (i) When the initial position xo — > 0, the 
first-passage time tf must also be 0. Hence the integral J Q f U (x(t)) dr = 0. From the definition in Eq. J5UJ), we 
get Q(xq = 0) = 1 and (ii) when the initial position xo — * oo, the first-passage time tf — ► oo, hence the integral 
L f U (x(t)) g?t also diverges in this limit, at least when U(x) is a nondecreasing function of x. The definition in 
Eq. I|50|) then gives the boundary condition, Q(xq — > oo) = 0. 

So, given a functional U(x), the scheme would be to first solve the the ordinary differential equation i|52|) with the 
appropriate boundary conditions mentioned above to obtain Q(xq) explicitly and then invert the Laplace transform 
in Eq. (|49|l to get the desired pdf P(T\xq) of the first-passage functional. As a simple test of this method, let us first 
consider the case U(x) = 1. In this case the functional T = §J U (x(t)) dr — tf is the first-passage time itself. The 
differential equation (|52() can be trivially solved and the solution satisfying the given boundary conditions is simply 

Q(x )=e-^°. (53) 
Inverting the Laplace transform with respect to p gives the pdf of the first-passage time 

p{ « M = 7k—- (54) 

which is identical to the result in Eq. I|24|) obtained by the path integral method. Below we provide a few nontrivial 
examples and applications of this method. 



A. Area till the first-passage time 

Here we calculate the pdf of the area under a Brownian motion (starting at xq) till its first-passage time Thus 
the relevant functional is A = J Q f x(t)c?t and hence U{x) = x. In Fig. A is just the area under the curve over 
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the time interval [0, tf]. This problem has many applications in combinatorics and queuing theory. For example, an 
important object in combinatorics is the area of a lattice polygon in two dimensions |6l|. A particular example of a 
lattice polygon is the rooted staircase polygon whose two arms can be thought of as two independent random walkers 
whose trajectories meet for the first time at the end of the polygon. The difference walk between these two arms 
then defines, in the continuum limit, a Brownian motion. The area of such a polygon can then be approximated, in 
the continuum limit, by the area under a single Brownian motion till its first-passage time |fifij . This picture also 
relates this problem to the directed Abelian sandpile model [6^| where tf is just the avalanche duration and the area 
A is the size of an avalanche cluster. Another application arises in queueing theory, where the length of a queue l n 
after n time steps evolves stochastically |6l| . In the simplest approximation, one considers a random walk model, 
In = l-n-i + Cn where £ n 's are independent and identically distributed random variables which model the arrival and 
departure of new customers. When the two rates equal, (£„) = 0. In the large n limit, /„ can be approximated by a 
Brownian motion x(t), whereupon tf becomes the so called 'busy' period (i.e. the time until the queue first becomes 
empty) and the area A then approximates the total number of customers served during the busy period. 

Substituting U(x) = x in Eq. (|52(l . one can solve the differential equation with the prescribed boundary conditions 
and the solution is 601 



Q{xq) 



= 3 2/3 r(2/3)Ai(2 1/ V /3 a;o) 



(55) 



where Ai(z) is the Airy function, 
for the pdf [13 



It turns out that this Laplace transform can be inverted to give an explicit expression 



P(A\x ) 



2 l/3 



.c 



3 2 / 3 r(l/3) A*/ 3 



exp 



2^0 
' 9A 



(56) 



Thus the pdf has a power law tail for large A ^> x^, P(^4|a;o) ~ A~ 4 / 3 and an essential singularity P(A\xo) ~ 
exp[— 2xo/9A] for small A — > 0. Following the same techniques, one can also derive the pdf of the area till the first- 
passage time under a Brownian motion with a drift towards the origin-in this case the pdf has a stretched exponential 
tail for large A P(A\x ) ~ v4~ 3 / 4 exp[-^8^ 3 A/3] where (i is the drift. 

Note the difference between the pdf of the area P(A\xq), under a Brownian motion till its first-passage time starting 
at xq at t = 0, as given in Eq. I|56|l and the pdf of the area under a Brownian excursion P(A, f) in Eq. (|43|l . In 
the latter case, the Brownian path is conditioned to start at xq = at r = and end at x — at r = t and one is 
interested in the statistics of the area under such a conditioned path over the fixed time interval t. In the former case 
on the other hand, one is interested in the area under a free Brownian motion starting at xq > and propagating up 
to its first-passage time tf that is not fixed but varies from one realization of the path to another. 



B. Time period of oscillation of an undamped particle in a random potential 

The study of transport properties in a system with quenched disorder is an important area of statistical physics |63| . 
The presence of a quenched disorder makes analytical calculations hard and very few exact results are known. Perhaps 
the simplest model that captures some complexities associated with the transport properties in disordered systems is 
that of a classical Newtonian particle moving in a one dimensional random potential 4>(x) 



m S" r f =/ ,r>7i 



where F(x) — —d<fi/dx is the force derived from the random potential 4>{x), T is the friction coefficient and £(t) is the 
thermal noise with zero mean and a delta correlator, (£(t)£,(t')) — 2DS(t — t') with D = ksT/T by the Stokes-Einstein 
relation ©. 

It turns out that even this simple problem is very hard to solve analytically for an arbitrary random potential 4>(x). A 
special choice of the random potential where one can make some progress is the Sinai potential [21} , where one assumes 
that <p(x) = Jq rj{x')dx' . The variables ry(a;)'s have zero mean and are delta correlated (r](xi)r](x2)) = S(xi — X2)- 
Thus the potential <fi( x ) itself can be considered as a Brownian motion in space. In the overdamped limit when the 
frictional force is much larger than the inertial force, Eq. (|57l) then reduces to the Sinai model [23 

r^ = F(x = x(t))+m (58) 

where the random force F(x) = —d(j>/dx = r)(x) is just a delta correlated white noise with zero mean: (F(x)) — and 
(F[x)F(x')) = S(x - x'). 
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Here we consider a simple model [64( where the particle diffuses in the same Sinai potential <j>{x) — J* r](x')dx', 
but we consider the opposite limit where the particle is undamped, i.e. r = and is driven solely by the inertial force. 
For simplicity, we also consider the zero temperature limit where the thermal noise term drops out of Eq. I|57|> as well 
and one simply has 

m^ = F(x=x(t)) (59) 

where F(x) is a same random Sinai force as mentioned above. We set m = 1 and assume that the particle starts at 
the origin x — with initial velocity v > 0. Thus the particle will move to the right till it reaches a turning point 
x c where the potential energy becomes equal to the kinetic energy, i.e. <j)(x c ) = v 2 /2 and then it will move back to 
x = with a velocity — v (see Fig. ©). After returning to the origin with velocity — u, the particle will go to the 
left till it encounters the first turning point on the left of the origin where it will turn and then will return to the 
origin. Let T and T 1 denote the time for the particle to go from the origin to the turning point at the right and to 
the one at the left respectively. Thus the particle will oscillate between the two turning points nearest to the origin 
on either side and the time period of oscillation is T osc = 2(T + T 1 ). Note that the variables T and T" will vary from 
one sample of quenched disorder to another. The goal is to compute the probability distribution of T and T' and 
hence that of T osc . Since (f>(x) is a Brownian motion in x, it follows from its Markov property that (j>(x) for x > 
and for x < are completely independent of each other. Thus T and T 1 are also independent and by symmetry, have 
identical distributions. The distribution of T osc can then be easily calculated by convolution. 

To compute the pdf P(T) of T (starting at xo — 0), we first express T as a functional of the Brownian potential 

dx 

, 60 

where x c is defined as the point where 4>(x c ) — v 2 . On identifying the space as the time x = r and the random 
potential <j> as the trajectory of a random walk in space x, i.e. tf> «-> x, x <-> r, T in Eq. H6()|l is of the general form 
in Eq. H48|) with U(x) = 1/y/v 2 — 2x and x c = tf denoting the first-passage time to the level x = v 2 /2, starting at 
x . Following the general scheme, we need to solve the differential equation now valid for — oo < xo < v 2 /2, 
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with U(x) = 1/ ' \lv 2 — 2x and the boundary conditions, Q(xq — > — oo) = and Q(xq — > w 2 /2) = 1. Upon finding 
the solution one needs to put xq = and then invert the Laplace transform. This can be done explicitly and one 
obtains [64| 

2 2/3 2 1 

1 ' 3 4 / 3 r(2/3) T 5 / 3 

This is one of the rare examples of an exact result on a transport property in a quenched disorderd system, thus 
illustrating the power of the approach outlined in this section. 



2v 6 

w 



(61) 



C. Distribution of the Lifetime of a Comet in Solar System 

In this final subsection we provide an example from astrophysics |65j where the general technique of the first-passage 
Brownian functional is applicable. A comet enters a solar system with a negative energy Eq < and keeps orbiting 
around the sun in an elliptical orbit whose semimajor axis length a is determined by the relation Eq = —GM/2a 
where G is the gravitational constant and M is the mass of the sun. It was first pointed out by Lyttleton j^| that 
the energy of the comet gets perturbed by Jupiter each time the comet visits the neighbourhood of the sun and the 
planets and successive perturbations lead to a positive energy of the comet which then leaves the solar system. It is 
convenient to work with the negative energy x = —E > of the comet. We assume that the comet enters the solar 
system with initial negative energy xq and has values of x equal to x±, X2, ■ ■ ■, x tf at successive orbits till the last one 
labelled by tf when its value of x crosses (energy becomes positive) and it leaves the solar system. The lifetime of 
the comet is given by 

T = U(x a ) + U{x 1 ) + ...U(x tf ) (62) 

where U(x) is the time taken to complete an orbit with negative energy x > 0. According to Kepler's third law, 
U(x) = ci -3 / 2 where c is an constant which we set to c = 1 for convenience. Moreover, a simple way to describe the 
perturbation due to Jupiter is by a random walk model, x n — x n -i + £n where £„ is the noise induced by Jupiter 
and is assumed to be independent from orbit to orbit 65] . Within this random walk theory, the lifetime of a comet 
in Eq. (|62|l . in the continuum limit becomes a first-passage Brownian functional |65| 

T= f' f [x(r)}-^ 2 dT (63) 



where the random walk starts at Xq and ends at its first-passage time tf when it first crosses the origin. The pdf 
P(T|a:o) was first obtained by Hammersley [6{|- Here we show how to obtain this result using the general approach 
outlined here for first-passage Brownian functionals. 

Following our general scheme, we thus have U(x) — x~ z / 2 in the differential Eq. The solution, satisfying the 

proper boundary conditions, can be easily found 

Q(x ) = l&pxo 1/2 K 2 (v/32W /4 ) (64) 

where K^iz) is the modified Bessel function of degree 2. Next, we need to invert the Laplace transform in Eq. H64|) 
with respect to p. This can be done by using the following identity 

^ y -»-i e -»>-V*dy = 2 (f) ' KA^Vfo)- (65) 
Choosing (3 = 8yfx~o, we can invert the laplace transform to obtain the exact pdf P(T\xq) of the lifetime of a comet 



P{T\x Q ) = -^3- exp 



T 



(66) 



It is worth pointing out that in all three examples above, the pdf P{T\x$) of the first-passage Brownian functional 
has a power law tail P(T\xq) ~ T 1 for large T and and an essential singularity in the limit T — > 0. While the 
exponent of the power law tail can be easily obtained using a scaling argument, the essential singular behavior at 
small T is not easy to obtain just by a scaling argument. 
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VI. CONCLUSION 



In this article I have provided a brief and pedagogical review of the techniques to calculate the statistical properties 
of functionals of one dimensional Brownian motion. It also contains a section devoted to 'first-passage' Brownian 
functional, a quantity that appears in many problems but the techniques to calculate its properties are somewhat 
less known compared to the standard Feynman-Kac formalism for the usual Brownian functional. A simple backward 
Fokker-Planck approach is provided here to calculate the probability distribution of a first-passage Brownian func- 
tional. Several examples and applications of the standard Brownian functionals as well as the first-passage Brownian 
functionals from physics, probability theory, astronomy and in particular from computer science are provided. 

The techniques detailed in this article are valid for free Brownian motion in one dimension. However, they can be 
easily generalized to study the functionals of a Brownian motion in an external potential. The external potential can 
represent e.g. a constant drift [2^, |2^, |^3| or a harmonic potential Alternately, the external potential can be 

random as in a disordered system. The backward Fokker Planck approach reviewed here has been particularly useful 
in calculating exactly the disorder averaged distributions of Brownian functionals in the Sinai model (281 1391 l67| . 

There are several open directions for future research. For example, to the best of my knowledge, the properties of 
first-passage Brownian functionals have so far not been studied in disordered systems. The techniques discussed here 
could be useful in that direction. Though there have been few studies of Brownian functionals in higher dimensions, 
there are still many open problems with direct relation to experiments 12] and more studies in that direction would 
be welcome. Finally, the discussion in this article is limited to the simple Brownian motion which is a Gaussian 
as well as a Markov process. In many real systems, the relevant stochastic process often is non-Gaussian and/or 
non-Mar kovian. It would certainly be interesting to study the properties of functionals of such stochastic processes. 

In summary, I hope I have been able to convey to the reader the beauty and the interests underlying Brownian 
'functionalogy' with its diverse applications ranging from physics and astronomy to computer science, making it a 
true legacy of Albert Einstein whose 1905 paper laid the basic foundation of this interesting subject. 
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